Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Key information extraction algorithm of news Web pages
XIANG Jingjing, GENG Guanggang, LI Xiaodong
Journal of Computer Applications    2016, 36 (8): 2082-2086.   DOI: 10.11772/j.issn.1001-9081.2016.08.2082
Abstract632)      PDF (888KB)(597)       Save
Since information extraction algorithm for Web pages lacks generality and information of title, release-time and source in news Web page, a new information extraction algorithm was proposed to resolve those problems. Firstly, HTML code of Web page was parsed to text sets combined with line number and text; then, extractor began to search boundary of news content from line which the longest sentence belonged to due to the characteristic that the longest sentence belongs to the content of news with an extremely high probability. Meanwhile, the longest common string algorithm was used to extract title, the regular expression and line number were used to extract release-time, and the presentation characteristics of source and line number were used to extract source. Finally, a data set was built to conduct a comparison experiment with an open-source software named newsPaper in accuracy of extraction. Experimental results show that newsExtractor outperforms newsPaper in average accuracy of content, title, release-time and source, it has strong generality and robustness.
Reference | Related Articles | Metrics
Mobile terminal positioning method driven by road test data
YUAN Guangjie, LI Xiaodong, JIANG Zhaoyi, YUAN Peng, GUO Zhiwei
Journal of Computer Applications    2016, 36 (12): 3515-3520.   DOI: 10.11772/j.issn.1001-9081.2016.12.3515
Abstract883)      PDF (979KB)(325)       Save
The current wireless positioning technology can not adapt to complex environment and has low positioning accuracy. In order to solve the problems, a mobile terminal positioning method driven by road test data was proposed. Firstly, based on the location algorithm of base station and the description algorithm of base station signal coverage, the location-coverage model of base station base was established. By matching the initial parameters of the mobile terminal with the model base, the initial range of the mobile terminal was obtained. Secondly, the road classification database was established based on the extraction algorithm of road feature, and the wireless signal feature matching algorithm was used to match the road information of the mobile terminal. Finally, the model base of longitude-latitude and intensity mapping was established and the precise position of the mobile terminal was determined by using the terminal signal comparison algorithm. The theoretical analysis and experimental results show that the probability of 2 m localization accuracy of the base station reaches 60%, the probability of 3 m reaches 77%, which are improved respectively by about 39% and 12% than those before whitening, and the description algorithm of base station signal coverage can also describe the coverage of base station signal more accurately. The accuracy improvement of the two parts can improve the final positioning accuracy.
Reference | Related Articles | Metrics